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4 <110> APPLICANT: Boehringer Ingelheim (Canada) Ltd. 

6 <120> TITLE OF INVENTION: Purified Active HCV NS2/3 Protease 

9 <130> FILE REFERENCE: 13/082 

C--> 11 <140> CURRENT APPLICATION NUMBER: US/10/017 , 736 

C--> 11 <141> CURRENT FILING DATE: 2001-12-14 

11 <150> PRIOR APPLICATION NUMBER: 60/256,031 

12 <151> PRIOR FILING DATE: 2000-12-15 
14 <160> NUMBER OF SEQ ID NOS : 21 

16 <170> SOFTWARE: FastSEQ for Windows Version 4.0 



20 <212> TYPE: DNA 

21 <213> ORGANISM: HCV 

23 <220> FEATURE: 

24 <221> NAME/KEY: CDS 

25 <222> LOCATION: (1)...(1230) 

27 <400> SEQUENCE: 1 

28 atg gac egg gag atg get gca teg tgc gga ggc gcg gtt ttc ata ggt 4 8 

29 Met Asp Arg Glu Met Ala Ala Ser Cys Gly Gly Ala Val Phe lie Gly 

30 1 5 10 15 

32 ctt gca etc ttg acc ttg tea. cca tac tat aaa gtg etc etc get agg 96 

33 Leu Ala Leu Leu Thr Leu Ser Pro Tyr Tyr Lys Val Leu Leu Ala Arg 

34 20 25 30 

36 etc ata tgg tgg tta cag tat tta ate acc aga gtc gag gcg cac ttg 144 

37 Leu lie Trp Trp Leu Gin Tyr Leu lie Thr Arg Val Glu Ala His Leu 

38 35 40 45 

40 caa gtg tgg ate ccc cct etc aat gtt egg gga ggc cgc gat gec ate 192 

41 Gin Val Trp lie Pro Pro Leu Asn Val Arg Gly Gly Arg Asp Ala lie 

42 50 55 60 

44 ate etc etc acg tgc gca gtc cac cca gag eta ate ttt gac ate acc 240 

45 lie Leu Leu Thr Cys Ala Val His Pro Glu Leu lie Phe Asp lie Thr 

46 65 70 75 80 

48 aaa etc ctg etc gee ata ttc ggt ccg etc atg gtg etc cag gca ggc 288 

49 Lys Leu Leu Leu Ala lie Phe Gly Pro Leu Met Val Leu Gin Ala Gly 

50 85 90 95 

52 ata acc aaa gtg ccg tac ttc gtg cgt gcg cag ggg etc att cgt gcg 336 

53 lie Thr Lys Val Pro Tyr Phe Val Arg Ala Gin Gly Leu lie Arg Ala 

54 100 105 110 

56 tgt atg ttg gtg egg aag get gcg ggg ggt cat tat gtc caa atg gee 384 

57 Cys Met Leu Val Arg Lys Ala Ala Gly Gly His Tyr Val Gin Met Ala 

58 115 120 125 

60 ttc atg aag eta get gcg ctg aca ggt acg tac gtt tat gac cat etc 432 

61 Phe Met Lys Leu Ala Ala Leu Thr Gly Thr Tyr Val Tyr Asp His Leu 

62 130 135 ~ 140 

64 act cca ttg cag gat tgg gee cac gcg ggc eta cga gac ctt gca gtg 480 
6 5 Thr Pro Leu Gin Asp Trp Ala His Ala Gly Leu Arg Asp Leu Ala Val 
66 145 150 155 160 



18 <210> 

19 <211> 



SEQ ID NO: 1 
LENGTH: 1230 



ENTERED 
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ttg 


cag 


gat 


tgg 


gcc 


cae 


gcg 


ggc 


L Ld 


cga 


gac 


L L U 


gca 


Z ft u 


990 

Z» Z> V/ 


Leu 


Thr 


Pro 


Leu 


Gin 


Asp 


Trp 


Ala 


nio 


Al ^ 


VJiy 


XiC u 


A TTT 


Aon 


iJC u. 


Al Pi 
/Aid, 




991 

Z Z X 


65 










70 










7 R 














9 9 7 


gtg 


gcg 


gta 


gag 


ccc 


gtc 


ate 


ttc 


+- «+- 
LCU 


gac 


a +■ rr 

aty 


gag 


gtc 


aag 


a lc 


dLC 


9 ft ft 

zoo 


994 

Z ^ *i 


Val 


Ala 


Val 


Glu 


Pro 


Val 


He 


Phe 


OCX 


Aon 


Mot 

lit: L- 


rin 

ulU 


Va 1 
V dX 


T tr c 

Xiy o 


Tl 
lit: 


Tl <=» 
lie 




9 9 R 

ZZ J 










85 










Qn 
















997 
z z / 


acc 


tgg 


ggg 


gcg 


gac 


acc 


gcg 


gca 


tgc 


ggg 


ga c 


ai-p 
dtC 


-a 4- +- 

alt 


LCd 


ggt 


ctg 


J JO 


228 


Thr 


Trp 


Gly 


Ala 


Asp 


Thr 


Ala 


Ala 


Cys 


Gly 


Sen 


Tl <a> 

11C 


Tl P 

11C 


Cp-r 
O CI 


vJ X Jr 


Leu 




9 9 Q 

Z Z .7 








100 










1 OS 

X U O 










1 1 n 

X X \J 








9 71 
z OX 


ccc 


gtc 


tec 


get 


cga 


agg 


gga 


agg 


gag 


d Ld 


etc 


ctg 


gga 


ccg 


gcc 




7 ft A 
O O ft 


232 


Pro 


Val 


Ser 


Ala 


Arg 


Arg 


Gly 


Arg 


Glu 


lie 


Leu 


Leu 


Glv 
oxy 


Pro 


Ala 


Asp 




233 






115 










120 










125 










235 


aat 


ttt 


gaa 


ggg 


cag 


ggg 


tgg 


cga 


etc 


ctt 


gcg 


ccc 


ate 


acg 


gcc 


tac 


432 


236 


Asn 


Phe 


Glu 


Gly 


Gin 


Gly 


Trp 


Arg 


Leu 


Leu 


Ala 


Pro 


He 


Thr 


Ala 


Tyr 




237 




130 










135 










140 












239 


tec 


caa 


cag 


aca 


egg 


ggc 


eta 


ctt 


ggt 


tgc 


ate 


ate 


acc 


age 


etc 


aca 


480 


240 


Ser 


Gin 


Gin 


Thr 


Arg 


Gly 


Leu 


Leu 


Gly 


Cys 


He 


He 


Thr 


Ser 


Leu 


Thr 




241 


145 










150 










155 










160 




243 


ggc 


egg 


gac 


aag 


aac 


cag 


gtc 


gag 


ggg 


gag 


gtt 


caa 


gtg 


gtc 


tec 


acc 


528 


244 


Gly 


Arg 


Asp 


Lys 


Asn 


Gin 


Val 


Glu 


Gly 


Glu 


Val 


Gin 


Val 


Val 


Ser 


Thr 




245 










165 










170 










175 






247 


get 


aca 


caa 


tct 


ttc 


ctg 


gcg 


acc 


tgc 


gtc 


aac 


ggc 


gtg 


tgt 


tgg 


act 


576 
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a ** \j 


Ala 


Thr 


Gin 


Ser 


Phe 


Leu 


Ala 


Thr 




Val 


As n 




V CL X 


Cys 


Trrj 
xxp 


Thr 




OAQ 








J. O V 










1 ft R 










1 QO 








9 R1 

Z Jl 


5 1 c 


+ + C 


td L. 




gec 


99 c 




aag 


ace 


rr cr 


gec 


ggc 


ccc 


aaa 


ggc 


c r* 

^ t— cl 


9 A 


-> 


Val 


Phe 


His 


v 


Ala 




Ser 




X I IX 


XI C- Li 


AT a 




Pro 


T,vc 
xj y o 


VJX Jf 


Pro 




9 






195 










900 










90S 

z O J 










^ j j 


o +- r 1 


acc 


cag 


ci l. y 


Let O 




acL L. 


cri~ cr 

gtg 


gac 


cag 


gac 


ClC 


gtc 


ggc 


4- ctct 


t-ciy 


679 
o / z 


256 


lie 


Thr 


Gin 


Met 




Thr 


Asn 


Val 


Asp 


Gin 


Asp 


Leu 


Val 


Glv 


Trrj 


Gin 




9 




210 










91 R 

Z J — > 










990 

Z Z \J 














gcg 


ccc 


cct 


9" 9" 9" 


gcg 


cgc 


tec 


dLy 


aca 


cca 


tgc 


ace 


tgc 


ggc 


age 


teg 


790 
/ z u 


z u u 


Ala 


Pro 


Pro 


Gly 


Ala 


ax y 


Ser 


Met 


X 1 IX 


Pro 
i x w 




Thr 

X I IX 


Cys 


\j J- _y 


OCX 


Ser 




9 fi 1 

Z U _L 


99 s 

Z Z J 










Z Jw 










9 ^ R 

Z J — J 










940 




9 G ^ 


gac 


etc 


tat 




gtc 


acg 


aga 




gee 


gac 


gtc 


aUL 


ccg 


gtg 


cgc 


egg 


/ o o 


964 

Z O *r 




Leu 


Tyr 


J_lG LI 


V Ct J. 


rp V-i -*~ 
X I1X 


A >*rr 

Arg 


n x o 


AX a 


A 

nop 


V Ci X 


lie 


tX u 


V Cl X 


A ttt 
ax y 


A T*fT 




9 ^ 

Z 0 D 










9 A R 
Z ft O 










9^0 
z DKJ 










ZOO 






O £ 7 
ZD/ 


egg 


ggc 


gac 


ay L 


agg 




age 


ctg 


etc 


tec 


ccc 


agg 


cct 


gtc 


tec 


fa/, 

L.O.C 


PI 6 
o x D 


zoo 




Gly Asp 


Cp-r 






OCX 


J_lG u. 


JjC IX 


OCX 


Prn 
IT X LJ 


Arg 


Prn 

IT X 


V Ct X 


Cpr 
OCX 


Tvr 




Z O J7 








Z O v 










9fi S 

Z O yj 










970 

Z / \J 








271 


ttg 


aag 


ggc 


tct 


teg 


ggt 


ggc 


cca 


ctg 


etc 


tgc 


cct 


teg 


ggg 


cac 


get 


864 


272 


Leu 


Lys 


Gly 


Ser 


Ser 


Gly 


Gly 


Pro 


Leu 


Leu 


Cys 


Pro 


Ser 


Gly 


His 


Ala 




273 






275 










280 










285 










275 


gtg 


ggc 


ate 


ttc 


egg 


get 


get 


gtg 


tgc 


acc 


egg 


ggg 


gtt 


gca 


aaa 


gcg 


912 


276 


Val 


Gly 


He 


Phe 


Arg 


Ala 


Ala 


Val 


Cys 


Thr 


Arg 


Gly 


Val 


Ala 


Lys 


Ala 




277 




290 










295 










300 












279 


gtg 


gac 


ttc 


ata 


cct 


gtt 


gag 


tct 


atg 


gaa 


act 


acc 


atg 


egg 


act 


agt 


960 


280 


Val 


Asp 


Phe 


He 


Pro 


Val 


Glu 


Ser 


Met 


Glu 


Thr 


Thr 


Met 


Arg 


Thr 


Ser 




281 


305 










310 










315 










320 




283 


age 


get 


tgg 


cgt 


cac 


ccg 


cag 


ttc 


ggt 


ggt 


aaa 


aag 


aaa 


aag 


taa 




100 


284 


Ser 


Ala 


Trp 


Arg 


His 


Pro 


Gin 


Phe 


Gly 


Gly 


Lys 


Lys 


Lys 


Lys 


* 







285 325 330 

287 ggatcc 1011 

289 <210> SEQ ID NO : 4 

290 <211> LENGTH: 334 

291 <212> TYPE: PRT 

292 <213> ORGANISM: HCV 
294 <400> SEQUENCE: 4 



295 


Met 


Lys 


Lys 


Lys 


Lys 


Leu 


Glu 


His 


His 


His 


His 


His 


His 


Thr 


Ser 


Ala 


296 


1 








5 










10 










15 




297 


Gly 


He 


Thr 


Lys 


Val 


Pro 


Tyr 


Phe 


Val 


Arg 


Ala 


Gin 


Gly 


Leu 


He 


Arg 


298 








20 










25 










30 






299 


Ala 


Cys 


Met 


Leu 


Val 


Arg 


Lys 


Ala 


Ala 


Gly 


Gly 


His 


Tyr 


Val 


Gin 


Met 


300 






35 










40 










45 








301 


Ala 


Phe 


Met 


Lys 


Leu 


Ala 


Ala 


Leu 


Thr 


Gly 


Thr 


Tyr 


Val 


Tyr 


Asp 


His 


302 




50 










55 










60 










303 


Leu 


Thr 


Pro 


Leu 


Gin 


Asp 


Trp 


Ala 


His 


Ala 


Gly 


Leu 


Arg 


Asp 


Leu 


Ala 


304 


65 










70 










75 










80 


305 


Val 


Ala 


Val 


Glu 


Pro 


Val 


He 


Phe 


Ser 


Asp 


Met 


Glu 


Val 


Lys 


He 


He 


306 










85 










90 










95 




307 


Thr 


Trp 


Gly 


Ala 


Asp 


Thr 


Ala 


Ala 


Cys 


Gly 


Asp 


He 


He 


Ser 


Gly 


Leu 


308 








100 










105 










110 







Use of n and/or Xaa has been detected to the Sequence IMm 

Review the Sequence Listing to insure a corresponding 

S) explanation is presented in the <220> to <223> fields of 

' each sequence using a or Xaa. ______ — - 
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